policy improvement
Country:
- North America > United States > Washington > King County > Seattle (0.04)
- Asia > China > Liaoning Province > Dalian (0.04)
Technology:
Country:
- North America > United States > Washington > King County > Seattle (0.04)
- Asia > China > Liaoning Province > Dalian (0.04)
Technology:
Country:
- Asia > Middle East > Jordan (0.04)
- South America > Brazil > Rio Grande do Sul (0.04)
- North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
- (8 more...)
Technology:
Policy Improvement using Language Feedback Models
First, by using LFMs to identify desirable behaviour to imitate, we improve in task-completion rate over strong behavioural cloning baselines on three distinct language grounding environments (Touchdown, ScienceWorld, and ALFWorld). Second, imitation learning using LFMs outperform using LLMs as experts to directly predict actions, when controlling for the number of LLM output tokens.
Technology:
Country:
- North America > United States > New York > New York County > New York City (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Genre:
- Research Report > Experimental Study (1.00)
- Research Report > Strength High (0.93)
Technology:
Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
Technology:
Country:
- North America > United States > California > Los Angeles County > Long Beach (0.14)
- Europe > Switzerland > Zürich > Zürich (0.14)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (16 more...)
Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Country:
- North America > United States > California > Los Angeles County > Long Beach (0.14)
- Europe > Switzerland > Zürich > Zürich (0.14)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.05)
- (16 more...)
Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Country:
- North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
- North America > Canada (0.04)
- Asia > Middle East > Jordan (0.04)
Technology: